USE OF A TEXT GRAMMAR FOR GENERATING HIGHLIGHT ABSTRACTS OF MAGAZINE ARTICLES MARIE-FRANCINE MOENS and JOS DUMORTIER
نویسندگان
چکیده
S OF MAGAZINE ARTICLES MARIE-FRANCINE MOENS and JOS DUMORTIER {marie-france.moens, jos.dumortier}@law.kuleuven.ac.be Interdisciplinary Centre for Law & IT (ICRI) , Katholieke Universiteit Leuven Tiensestraat 41, B-3000 Leuven, Belgium Browsing a database of article abstracts is one way to select and buy relevant magazine articles online. Our research contributes to the design and development of text grammars for abstracting texts in unlimited subject domains. We developed a system that parses texts based on the text grammar of a specic text type and that extracts sentences and statements which are relevant for inclusion in the abstracts. The system employs knowledge of the discourse patterns that are typical of news stories. The results are encouraging and demonstrate the importance of discourse structures in text summarisation.
منابع مشابه
Generic technologies for single- and multi-document summarization
The technologies for singleand multi-document summarization that are described and evaluated in this article can be used on heterogeneous texts for different summarization tasks. They refer to the extraction of important sentences from the documents, compressing the sentences to their essential or relevant content, and detecting redundant content across sentences. The technologies are tested at...
متن کاملApproaches to Text Mining Arguments from Legal Cases
This paper describes recent approaches using text-mining to automatically profile and extract arguments from legal cases. We outline some of the background context and motivations. We then turn to consider issues related to the construction and composition of a corpora of legal cases. We show how a Context-Free Grammar can be used to extract arguments, and how ontologies and Natural Language Pr...
متن کاملSentence Compression for Dutch Using Integer Linear Programming
Sentence compression is a valuable task in the framework of text summarization. In this paper we compress sentences from news articles taken from Dutch and Flemish newspapers using an integer linear programming approach. We rely on the Alpino parser available for Dutch and on the Latent Words Language Model. We demonstrate that the integer linear programming approach yields good results for com...
متن کاملInteger Linear Programming for Dutch Sentence Compression
Sentence compression is a valuable task in the framework of text summarization. In this paper we compress sentences from news articles from Dutch and Flemish newspapers written in Dutch using an integer linear programming approach. We rely on the Alpino parser available for Dutch and on the Latent Words Language Model. We demonstrate that the integer linear programming approach yields good resu...
متن کاملSummarizing Texts at Various Levels of Detail
Summarizing document texts at various levels of detail is required for many information selection tasks. For instance, when loading and visualizing documents on small screens of handheld devices, it is important to be able to dynamically compress texts. In this article we discuss a technique of generating hierarchical topic trees of a text and to use them in various ways to build summaries of a...
متن کامل